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OPTIMIZATION WITH UNKNOWN 
OBJECTIVE FUNCTION 

DESCRIPTION 
BACKGROUND OF THE INVENTION 

5 Field of the Invention 

The present invention generally relates to the field of nonlinear 
optimization and, more particularly, to resource allocation, as for example, 
buffer pool optimization in computer database software where only the 
marginal utility is known. 

1 0 Background Description 

Conventionally, resource allocation maximizing some utility function 
using nonlinear optimization requires knowledge of the utility as a function of 
the allocated resources. For example, the resources could be the memory 
allocated to a series of computer processors and the utility is the aggregate 
15 performance of these processors or the reduction in power consumption of 

these processors. In prior art methods for solving this problem, knowledge of 
this utility function is required. Almost all prior art algorithms for solving this 
problem can be classified into one of the following three classes: 

1. Only the utility function/is known, i.e., at each point, there is a 

20 procedure to evaluate / (x). Examples of such algorithms include the 

simplex method and genetic algorithms. 

2. Only the utility function / and the gradient Vfare known. Examples 
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include steepest descent, conjugate gradient and Quasi -Newton 
methods. 

3. The utility function f 9 its gradient Vfand its Hessian V 2 f are known. 

Examples include Newton's method. 
Thus, in prior art algorithms, knowledge of the utility function is required in 
order to solve the optimization problem. In some applications, knowledge of 
this utility function is absent, and prior art iterative optimization algorithms 
cannot be used to solve this problem. 

As a specific example, buffer pool optimization in International 
Business Machines' (IBM's) DB2 computer database software is a problem 
where only the gradient of the utility function is known, i.e., at each point we 
know the benefit in terms of saved disk access time due to allocating more 
memory, but the utility, which in this case is the total saved disk access time 
due to current buffer pool setting, is unknown. 

Therefore, there is a need in the art for a method to allocate resources 
maximizing such utility using algorithms which do not need to know the 
utility as a function of the resources. 

SUMMARY OF THE INVENTION 

It is therefore an object of the present invention to provide a way to 
maximize the utility of an allocation of resources without the need to know the 
utility as a function of the allocated resources. 

In a general form, the problem can be formulated mathematically as 
follows. The object is to maximize the utility function / (x) subject to 
constraints on the vector*. Each vector x denotes a particular allocation of 
resources and the constraints generally describe properties of the resources 
which must be satisfied, for instance, the total number of resources must be 
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less than a certain number, etc. 

The present invention provides a method for allocating resources by 
specifying an allocation x which maximizes / (x) while satisfying the 
constraints on x where the steps of the method do not require knowledge of f. 
5 Sometimes the goal is to minimize some function fin which case we 

refer to/as a cost function rather than a utility function. In the nonlinear 
programming literature, /is usually called an objective function. 

In one embodiment of the present invention, a method for allocating 
resources (i.e., determining x) may comprise the steps of starting from an 
10 initial allocation, calculating the marginal utility of said allocation, calculating 

the constraint functions of said allocation, and applying this information to 
obtain a next allocation and these steps are repeated until a stopping criteria is 
satisfied, in which case a locally optimal allocation is returned. 

According to the invention, we provide algorithms which are used to 
15 solve the above nonlinear programming problem where 

1 . only the gradient is known, or 

2. only the gradient V/ and the Hessian V 2 / are known, 

i.e., there is no procedure to evaluate the objective function / These two cases 
are optimization problems which have not been considered before. A simple 
20 example of this problem is when / = jg 9 where g is an explicit function whose 

integral cannot be written in closed form. 

Our algorithms are derived from prior art optimization algorithms, but 
modified to not need evaluation of the objective function / 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 The foregoing and other objects, aspects and advantages will be better 

understood from the following detailed description of a preferred embodiment 
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of the invention with reference to the drawings, in which: 

Figure 1 is a block diagram which illustrates a data processing system 

in which a preferred embodiment of the invention is implemented; 

Figure 2 is a graph illustrating the buffer pool optimization achieved 
5 by the invention for an example utility function and example constraint 

functions; 

Figure 3 is a flow diagram illustrating the scalar line search 
implemented in a preferred embodiment; and 

Figure 4 is a flow diagram illustrating the method as implemented in a 
1 0 preferred embodiment. 



DETAILED DESCRIPTION OF A PREFERRED 
EMBODIMENT OF THE INVENTION 



Referring now to the drawings, and more particularly to Figure 1, there 
is shown in block diagram form a data processing system in which the 

15 invention has been implemented. This system comprises a DB2 Universal 

Database (UDB) server 100 which is a relational database management 
system. The UDB server 100 serves multiple DB2 clients 200 through DB2 
agents 101, 102, 109. The DB2 server has a plurality of disks 111, 112, 
1 19. Part of the data from the disks are stored in the memory, e.g., buffer 

20 pools 121 to 125, for fast access. DB2 performance is largely impacted by 

memory configuration. A proper configuration can put the right data (e.g., 
table, index) in the memory and thus reduce costly disk I/O time. The 
objective is to maximize the total saved disk time from all memory 
components (e.g., buffer pools). However, the total saved disk time cannot be 

25 easily measured. Instead, a benefit reporter 130 can be built to report the 

gradient information of the total saved disk time, so-called "benefit". The 
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benefit reporter 130 simulates a certain amount of memory increase (shown as 
simPages in Figure 2) and computes the possible saved disk time (shown as 
savedTime in Figure 2) due to this increase. The ratio of saved disk access 
time over the memory increase is reported as the "benefit" or the "benefit per 
page". (Refer to Figure 2 for more details on the difference between saved 
disk time and benefit.) The optimizer 140 operates based on this benefit 
information for resizing buffer pools to achieve optimal performance. 

Figure 2 is a graph showing the total saved disk time as a function of 
buffer pool size. The optimizer operates based on the "benefit" data - 
simulated saved disk access time per simulated-increasing memory page per 
memory component. The Karush-Kuhn-Tucker optimaltality condition for this 
problem specifies that the benefits are equal for all memory components 
which are not at the bounds d t at the point where the total saved disk time 
from all memory components is maximal and thus a possible tuning objective 
is to equalize the benefits from all memory components when possible. The 
plot in Figure 2 illustrates an example of the nonlinear relationship between 
saved disk time and memory size for the buffer pool. The saved disk time is 
increasing when the memory size is increasing, and becomes saturated when 
the memory is large enough to hold the whole table(s). This nonlinear 
relationship can be modeled by an exponential function. The tuning objective 
is to maximize the total saved disk time from all memory components, given 
the constraint that the total memory is finite. However, the saved disk time is 
difficult to measure because once the data is in the memory, the corresponding 
disk access time does not exist. Instead, a gradient reporter can be built to 
simulate increasing memory pages and to estimate the corresponding 
simulated saved disk time. This results in the approximated gradient - so 
called "benefitPerPage". 
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The present invention comprises a method, computer program product 
and system for resource allocation when the utility function is unknown and 
the marginal utility is known. The marginal utility with respect to a particular 
resource is the additional incremental utility obtained by incremental increase 
5 of that resource and is equivalent to the gradient of the utility function f(x). 

Examples of the utility are total time saved, number of transactions processed, 
power consumption saved, processors utilization, memory utilization, etc. 

All prior art iterative nonlinear optimization algorithms assumes 
knowledge of the utility function / and if available, the gradient V£ and the 
10 Hessian V 2 / 

In general, the problem is given as: 

max / (x) subject to constraints. 

X 

By replacing/ with -f 9 the problem is written as a minimization problem 
which is the standard form in nonlinear programming: 
15 min / (x) subject to constraints. 

X 

In this case we will call / the cost function rather than the utility 
function. Some examples of cost are total disk I/O (input/output) time, power 
consumption and overall system response time. The present invention 
provides a method for solving the minimization problem above without the 
20 need to evaluate / This is useful in applications where evaluating/is 

impossible, very expensive, time-consuming and/or inaccurate. An example 
of such /is when / = \g, where g is a function whose integral cannot be written 

in closed form, e.g., g = g x ~ . 

* 

Good text on nonlinear programming and numerical optimization 
25 include: Practical Optimization by P. Gill et al., Academic Press, 1981, 

Practical Methods of Optimization by R. Fletcher, Wiley & Sons, 1987, and 
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Numerical Optimization by J. Nocedal and S. Wright, Springer, 1999. 

We employ modification of line search based algorithms such as 
steepest descent, conjugate gradient, quasi-Newton or Newton's method. In 
these line search based algorithms, almost all components of the algorithm do 
not require evaluation of / The stopping criteria based on first or second order 
optimaltality conditions also do not require evaluation of/ The only place 
where evaluation of/is required is in the scalar line search component where 
a one-dimensional optimization is performed along a particular direction p. 
Our invention is to use a scalar line search method that does not require 
evaluation of/ For instance, in prior art backtracking line search along a 
search direction p, the step size a is decreased until the sufficient decrease or 
Armijo condition is satisfied: 

/ (x+ap) < f (x) +c x aVf (x) T p 

where A T denotes the transpose of the matrix (or vector) A. This requires 
evaluation of the function / The disclosed invention uses a backtracking line 
search that decreases the step a until the following condition is satisfied: 

(V/ (x+ap) - c x Vf{x)) T p < 0 (1) 

This results in a line search which uses only gradient information. Using this 
line search with the rest of the components in a prior art line search type 
nonlinear programming method results in a nonlinear programming method 
which uses only gradient information, and if available, Hessian information, to 
minimize / and does not require evaluation of /itself 

This is illustrated in Figure 3 which is a flow diagram of the scalar line 
search method in a preferred embodiment of the invention. The process begins 
with an initial value of a at input block 301. A determination is made in 
decision block 302 as to whether Equation (1) is satisfied. If not, the value of 
a is reduced in function block 303, and then a return is made to decision block 
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302. The reduction of a in function block 303 can be done by multiplying a 
by a constant c 2 < 1. Some typical values for c, and c 2 are c, = 0.0001, c 2 = 0.7. 
When Equation (1) is satisfied, as determined in decision block 302, the step 
size a is returned in output block 304. The step size a is then used by the 
5 optimization algorithm to compute the next solution point x + ap. Note that 

when /is convex, the condition in Equation (1) implies the Armijo condition 
and thus in this case convergence properties of prior art methods using the 
Armijo condition remain true for the disclosed method. 

In a preferred embodiment, the resources are a series of memory 
10 buffers in a computing system running the database software DB2 and the goal 

is to minimize access time by allocating memory to these buffers 
appropriately. This is illustrated in Figure 1. The allocation is given by a 
vector x = (x,, . . ., x n ) and the problem is to maximize / (x) subject to the 
constraint x. < c and x i > d r 

15 The utility function / (x) is the total disk time avoided by using the 

allocation vector x, whereas the partial derivative — is the time saved by 

dx f 

allocation more resources to x,-. This is illustrated in Figure 2. 

The gradient consists of the partial derivatives of f 9 and corresponds 
to the benefit in access time saved due to increased memory buffer allocation. 
20 The system configuration is such that is readily available, whereas /is not 

available (or requires too much effort to compute accurately). 

By the Karush-Kuhn-Tucker optimaltality condition, at the optimal 

point x = (x, , . . ., x„ ) where/is maximized, the gradient Vf satisfies 

V/(**) f = A,ifx/ >d i 
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V/(x*) f . < A, if */ = rf, 

for some real number A. To solve this problem where the inequality 
constraints are linear, which is a special case of the general problem, an active 
set line search algorithm can be used. 
5 A flowchart of such an algorithm is shown in Figure 4. A 

determination is made in decision block 401 as to whether the solution has 
converged. If not, a determination is next made in decision block 402 as to 
whether to delete a constraint from the working set. If so, a constraint is 
deleted from the working set in function block 403 and a return is made to 

10 decision block 401 ; otherwise, a feasible search direction is computed in 

function block 404. Then, in function block 405 a step length is computed via 
a line search. If necessary, a constraint is added to the working set in function 
block 406, and the solution is updated in function block 407 before a return is 
made to decision block 401. An output is provided when the solution has 

15 converged. More details can be found in Section 5.2.1 in the text by Gill et al., 

ibid. In the preferred embodiment, the modified backtracking line search 
described above is used to execute block 405 in Figure 4. Note that we need to 
replace/ with -fin Equation (1) as Equation (1) is given for minimizing/, 
whereas in the case of the buffer pool optimization, the goal is to maximize/ 

20 Such a change is common and known by those skilled in the art. 

Most line search based nonlinear optimization algorithms can be 
modified in the same manner by those skilled in the art to obtain a 
corresponding algorithm that does not require evaluation of / For example, for 
more general constraints, line search based methods such as penalty function 

25 methods, augmented Lagrangian methods, and sequential quadratic 

programming methods can be used, where the scalar line search component in 
these methods is replaced by the modified line search described above. 
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Furthermore, since the utility function /in Figure 2 is separable, techniques for 
exploiting separability, and more generally partial separability and group 
partial separability (see Chapter 9 of Nocedal and Wright, ibid.) can be used. 
While the invention has been described in terms of a single preferred 
5 embodiment, those skilled in the art will recognize that the invention can be 

practiced with modification within the spirit and scope of the appended 
claims. 
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